Morphdb.hu: Hungarian lexical database and morphological grammar
نویسندگان
چکیده
This paper describes morphdb.hu, a Hungarian lexical database and morphological grammar. Morphdb.hu is the outcome of a severalyear collaborative effort and represents the resource with the widest coverage and broadest range of applicability presently available for Hungarian. The grammar resource is the formalization of well-founded theoretical decisions handling inflection and productive derivation. The lexical database was created by merging three independent lexical databases, and the resulting resource was further
منابع مشابه
Benefits of Resource-Based Stemming in Hungarian Information Retrieval
This paper discusses the impact of resource-driven stemming in information retrieval tasks. We conducted experiments in order to identify the relative benefit of various stemming strategies in a language with highly complex morphology. The results reveal the importance of various aspects of stemming in enhancing system performance in the IR task of the CLEF ad-hoc monolingual Hungarian track. T...
متن کاملA New Integrated Open-source Morphological Analyzer for Hungarian
The goal of a Hungarian research project has been to create an integrated Hungarian natural language processing framework. This infrastructure includes tools for analyzing Hungarian texts, integrated into a standardized environment. The morphological analyzer is one of the core components of the framework. The goal of this paper is to describe a fast and customizable morphological analyzer and ...
متن کاملCreation and Development of the Romanian Lexical Resources
This article describes the Romanian lexical resources containing morphological data and dictionaries: synonyms, Romanian-English, and RomanianRussian. The inflection process at the creation of morphological resources based on the functional grammar with scattered context is considered. An arbitrary word is inflected knowing only its part of speech, and the gender for nouns. New words were obtai...
متن کاملThe GeLexi MT Project
Our double Machine (-Aided) Translation project to be demonstrated here (sections 3-4) is based on the parsing project [5-9, 12] of our Pécs theoretical and computational linguistics research team, called GeLexi (‘Generative LEXIcon’), whose (knowledge-based) parsers (section 2) rely on “totally lexicalist” [21, 2, 4, 7] representative mini-grammars of English and the (morphologically very rich...
متن کاملOn Hungarian Morphology
The aim of this study is to provide an autosegmental description of Hungarian morphology. Chapter 1 sketches the (meta)theoretical background and summarizes the main argument. In Chapter 2 phonological prerequisites to morphological analysis are discussed. Special attention is paid to Hungarian vowel harmony. In Chapter 3 a universal theory of lexical categories is proposed, and the category sy...
متن کامل